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y—i Abstract 

o 

^\i Our main result is that the Steiner Point Removal (SPR) problem can always be solved 

with polylogarithmic distortion, which resolves in the affirmative a question posed by Chan, 
Q^ Xia, Konjevod, and Richa (2006). Specifically, we prove that for every edge- weighted graph 

G = {V,E,w) and a subset of terminals T C V, there is a graph G" = (T,E',w') that is 
isomorphic to a minor of G, such that for every two terminals u,v G T, the shortest-path 
distances between them in G and in G' satisfy d(5„, (u, w) < dGi^yj'{u,v) < 0(log \T\)-dQyj{u,v). 
Our existence proof actually gives a randomized polynomial-time algorithm. 

Our proof features a new variant of metric decomposition. It is well-known that every finite 

metric space {X,d) admits a /3-separating decomposition for /3 = 0(log|X|), which roughly 

c/3 means for every desired diameter bound A > there is a randomized partitioning of X, which 

, ^, satisfies the following separation requirement: for every x,y G X, the probability they lie in 

different clusters of the partition is at most /3 d{x, y)/ A. We introduce an additional requirement, 

7—i which is the following tail bound: for every shortest-path P of length d{P) < A//3, the number 

^ of clusters of the partition that meet the path P, denoted Zp, satisfies Pr[Zp > t] < 2e~^'-*^ for 

2; aU i > 0. 
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1 Introduction 

Graph compression describes the transformation of a given graph G into a smah graph G' that 
preserves certain features (quantities) of G, such as distances or cut values. Notable examples 
for this genre include graph spanners, distance oracles, cut sparsifiers, and spectral sparsifiers, see 
e.g. |PS89| ITZ051 IBK961 IBSS09J and references therein. The algorithmic utility of such graph 
transformations is clear - once the "compressed" graph G' is computed (as a preprocessing step), 
further processing can be performed on G' instead of on G, using less resources like runtime and 



memory, or achieving better accuracy (when the solution is approximate). See more in Section 1.3 
Within this context, we study vertex- spar sification, where G has a designated subset of vertices 
T, and the goal is reduce the number of vertices in the graph while maintaining certain properties 
of T. A prime example for this genre is vertex-sparsifiers that preserve terminal versions of (mul- 
ticommodity) cut and flow problems, a successful direction that was initiated by Moitra |Moi09| 
and extended in several followups [LMlOl K^LLMlOl IMMini lEaK+inl I(^hul2j . Our focus here 
is different, on preserving distances, a direction that was seeded by Gupta |Gup01| more than a 
decade ago. 

Steiner Point Removal (SPR). Let G = {V,E,w) be an edge- weighted graplrj and let T = 

{ti, . . . ,tk} C y be a designated set of k terminals. Here and throughout, dc^wi', •) denotes the 
shortest-path metric induced by w on the vertices of G. The Steiner Point Removal problem asks 
to construct on the terminals a new graph G' = {T,E',w') such that (i) distances between the 
terminals are distorted at most by factor a > 1, formally 

Vn, f E T, dG,wiu,v) < dG',w'iu,v) < a ■ dG,w{u,v); 

and (ii) the graph G' is (isomorphic to) a minor of G. This formulation of the SPR problem was 
proposed by Chan, Xia, Konjevod, and Richa |CXKR06l Section 5] who posed the problem of 
bounding the distortion a (existentially and/or using an efficient algorithm). Our main result is to 
resolve their open question. 

Requirement (ii) above expresses structural similarity between G and G'; for instance, if G is 
planar then so is G". The SPR formulation above actually came about as a generalization to a 
result of Gupta |Gup01| , which achieves a = 8 for the case where G is a treePJand this factor of 8 
was proved by [CXKR06] to be tight. The upper bound for trees was later extended by Basu and 
Gupta [BG08J . who achieve distortion a = 0(1) for the larger class of outerplanar graphs. 

How to construct minors. We now describe a general methodology that is natural for the SPR 
problem. The first step constructs a minor G' with vertex set T, but without any edge weights, 
and is prescribed by Definition |1.1[ The second step determines edge weights w' such that dc',?. 



dominates d^w on the terminals T, and is given in Definition 1.2 These steps are illustrated 
in Figure fl} Our definitions are actually more general (anticipating the technical sections), and 
consider G' whose vertex set is sandwiched between T and V. Define a partial partition of V to be 
a collection Vi, . . . , V^ of pairwise disjoint subsets of V. 



^Throughout, all graphs are undirected and all edge weights are positive. 

^In fact, Gupta |Gup01| only argued that G' is a tree. Chan et al. [CXKR06] observed later that this same G' 
is actually a minor of G. 




Figure 1: G is a 9-cycle with 3 terminals and unit edge weights. Its terminal-centered minor and 
the standard restriction edge weights are shown on the right. 



Definition 1.1 (Terminal-Centered Minor). Let G = {V,E) be a graph with k terminals T = 
{ii, . . . , tk}, and letVi,...,Vk be a partial partition of V , such that each induced subgraph G[Vj] is 
connected and contains tj. The graph G' = (V' , E') obtained by contracting each G\Vj\ into a single 
vertex that is identified with tj, is called the terminal-centered minor of G induced by Vi, . . . ,Vk- 

By identifying the "contracted super- node" Vj with tj, we may think of the vertex-set V' as 
containing T and (possibly) some vertices from V\T. K terminal-centered minor G' of G can also 
be described by a mapping / : F — )• T U {-L}, such that f\T = id and /~^({tj}) is connected in G 
for ah j G [k]. Indeed, simply let Vj = f-^{{tj}) for all j E [k], and thus V \ {UjVj) = f-^{{l.}). 

Definition 1.2 (Standard Restriction). Let G = {V, E, w) be an edge-weighted graph with terminals 
set T, and let G' = {V',E') be a terminal- centered minor of G. The standard restriction of w to 
G' is the edge weight w' given by the respective distances in G, formally 

V(x,y) G E', w'^y ■= dG,w{x,y). 

This edge weight w' is optimal in the sense that dc^w' dominates dc,™ (where it is defined, i.e., 
on y), and the weight of each edge (x,y) E E' is minimal under this domination condition. 

1.1 Main Result 

Our main result below gives an efficient algorithm that achieves polylog(A;) distortion for the SPR 
problem. Its proof spans Sections [3] and [4j though the former one contains the heart of the matter. 

Theorem 1.3. Let G = (y,E,w) be an edge-weighted graph with k terminals T C y. Then there 
exists a terminal- centered minor G' = {T,E',vj') of G, such that G' has distortion 0(log k), i.e., 

Vn, V e T, 1 < < C(log k). 

dG,w[u,v) 

Moreover, w' is the standard restriction of w, and G' is computable in randomized polynomial time. 



This theorem answers a question of [CXKROG ] r] The only distortion lower bound known for 
general graphs is a factor of 8 (which actually holds for trees) [CXKROGJ . and thus it remains a 
challenging open question whether 0(1) distortion can be achieved in general graphs. 

Our proof of Theorem 1.3 begins similarly to [EGK"'"l()j . by iterating over the "distance scales" 



2*, going from the smallest distance dG,w{u,v) among all terminals u,v G T, towards the largest 
such distance. Each iteration i first employs a "stochastic decomposition", which is basically a 
randomized procedure that partitions V into so-called "clusters" (disjoint subsets) whose diameter 
is at most 2*. Then, some clusters are contracted to a nearby terminal, which must be "adjacent" 
to the cluster; this way, the current graph is a minor of the previous iteration's graph, and thus 
also of the initial G. Let us now skip some technical details about the connectivity of the clusters, 
which is important for the contractions to be legit, and just say that at a high level, after iteration 
i is executed, we roughly expect "areas" of radius proportional to 2* around the terminals to be 
contracted. As i increases, these areas get larger until eventually all the vertices are contracted 
into terminals, at which point the weights are set to be standard restriction. 

The main challenge is to control the distortion, and this is where we crucially deviate from 
|EGK"'"10] (and differ from all previous work). In their randomized construction of a minor G' , 
for every two terminals u,v £ T it is shown that G' contains a u — v path of expected length at 
most 0(logk)dG{u,v). Consequently, they design a distribution D over minors G' , such that the 
stretch dG'{u,v)/dG{u,v) between any u,v £ T has expectation at most 0(logA;)|^ In contrast, 
in our randomized construction of G' , the stretch between u,v G T is polylogarithmic with high 
probability, say at least 1 — 1/k^. Applying a simple union bound over the (2) terminal pairs, 
we can then obtain a single graph G' achieving a polylogarithmic distortion. Technically, these 
bounds follow by fixing in G a shortest-path P between two terminals u,v € T, and then tracking 
the execution of the randomized algorithm to analyze how the P evolves into a u — v path P' in 
G'. In [EGK^IOJ . the length of P' is analyzed in expectation, which by linearity of expectation, 
follows from analyzing a single edge. In contrast, we provide for it a high-probability bound, which 
inevitably must consider (anti)correlations along the path. 

The next section features a new tool that we developed in our quest for high-probability bounds, 
and which may be of independent interest. For sake of clarity, we provide below a vanilla version 
that excludes technical complications such as terminals, strong diameter, and consistency between 
scales. The proof of Theorem |1.3| actually does require these complications, and thus cannot use 
the generic form described below. 

1.2 A Key Technique: Metric Decomposition with Concentration 

Metric decomposition. Let {X, d) be a metric space, and let 11 be a partition of X. Every 5 G 11 
is called a cluster, and for every x £ X, we use n(x) to denote the unique cluster 5 G 11 such that 
X G S. In general, a stochastic decomposition of the metric {X, d) is a distribution /x over partitions 
of X, although we usually impose additional requirements. The following definition is perhaps the 
most basic version, often called a separating decomposition or a Lipschitz decomposition. 

Definition 1.4. A metric space {X,d) is called /3-decomposable if for every A > there is a 
probability distribution /i over partitions of X, satisfying the following requirements: 



^We remark that distortion k can be achieved relatively easily, by considering a mapping / that sends every vertex 
into its nearest terminal, and taking the minor corresponding to / with standard-restriction edge weights. 
*But it is possible that no G' € supp(Z)) achieves a low stretch simultaneously for all u,v € T. 



(a). Diameter bound: for all U £ supp(/x) and all 5 € 11, diani(5') < A. 



(b). Separation probability: for all x,y £ X, Pr [11(2;) 7^ n(y)] < 



I3d{x,y) 



Bartal |Bar96j proved that every n-point metric is 0(logn)-decomposable, and that this bound 
is tight. We remark that by now there is a rich hterature on metric decompositions, and different 
variants of this notion may involve terminals, or (in a graphical context) connectivity require- 
ments inside each cluster, see e.g. |Bar96l K^KROIL IFETn4[ IBarn4[ iLNOBl K^tNRini lEaK+ini IMNOTI 
lAGMWlOLlKRTT] . 

Degree of separation. Let P = (xo,xi, . . . ,X£) be a shortest path between xo,Xi £ X, i.e., a 
sequence of points in X such that J2ie\i] '^(^«-i>^i) = d{xQ,Xi). We denote its length by d{P) := 
d{xo, xi), and say that P meets a cluster S^XifSCiP^^. Given a partition 11 of X, define 
the degree of separation Zpili) as the number of diff'erent clusters in the partition 11 that meet P. 
Formally 

Zp(n) := 2^ l{p meets 5}- (1) 

Sen 

Throughout, we omit the partition 11 when it is clear from the context. When we consider a 
random partition 11 ~ /i, the corresponding Zp = Zpiji) is actually a random variable. If this 
distribution ^ satisfies requirement | (b)\ of Definition 1.4, then 



E [Zp] < 1 + E P^ P(x,_0 / n(x.)] < 1 + J: M:^il-llM = 1 + M^. (2) 

But what about the concentration of Zpl More precisely, can every finite metric be decomposed, 
such that every shortest path P admits a tail bound on its degree of separation Zpl 

A tail bound. We answer this last question in the affirmative using following theorem. We prove 
it, or actually a stronger version that does involve terminals, in Section [2] 

Theorem 1.5. For every n-point metric space {X^d) and every A > there is a probability 



distribution fi over partitions of X that satisfies, for (3 = O(logn), requirements (a) (b) of Definition 
\l-4\ o-nd furthermore 

(c). Degree of separation: For every shortest path P of length d{P) < -g-, 

Vt > 1, Pr [Zp >t]< 2e"^(*). (3) 

The tail bound O can be compared to a naive estimate that holds for every /3-decomposition 
/i: using ([2]) we have E[Zp] < 2, and then by Markov's inequality Pr[Zp >t]< 2/t. 



1.3 Related Work 

Applications. Vertex-sparsification, and the "graph compression" approach in general, is obvi- 
ously beneficial when G' can be computed from G very efficiently, say in linear time, and then G' may 
be computed on the fly rather than in advance. But compression may be valuable also in scenarios 
that require the storage of many graphs, like archiving and backups, or rely on low-throughput 
communication, like distributed or remote processing. For instance, the succinct nature of G' may 
be indispensable for computations performed frequently, say on a smartphone, with preprocessing 
done in advance on a powerful machine. 

We do not have new theoretical applications that leverage our SPR result, although we an- 
ticipate these will be found later. Either way, we believe this line of work will prove technically 
productive, and may influence, e.g., work on metric embeddings and on approximate min-cut/max- 
flow theorems. 

Probabilistic SPR. Here, the objective is not to find a single graph G' = (T, E\ w'), but rather 
a distribution D over graphs G' = {T,E' ,w'), such that every graph G' G supp(L') is isomorphic 
to a minor of G and its distances dcw' dominate dc^w (on T x T), and such that the distortion 
inequalities hold in expectation, that is, 

\/u,veT, E [dQi^^i{u,v)] < a ■ dcwiujv). 

This problem, first posed in [CXKROG] . was answered in JEGK"'"10j with a = 0(log \T\). 

Distance Preserving Minors. This problem differs from SPR in G' may contain a few non- 
terminals, but all terminal distances should be preserved exactly. Formally, the objective is to find 
a small graph G' = {V' ,E' ,w') such that (i) G' is isomorphic to a minor of G; (ii) T C y' C 1/; 
and (iii) for every u,v £ T, dG\w'iu,v) = dG,w{u,v). This problem was originally defined by 
Krauthgamer and Zondiner [KZ12J . who showed an upper bound \V'\ < 0{\T\^) for general graphs, 
and a lower bound of r2(|Tp) that holds even for planar graphs. 

2 Metric Decomposition with Concentration 



We now prove a slightly stronger result than that of Theorem 1.5, stated as Theorem 2.2, Let 
{X, d) be a metric space, and let {ti, . . . , tk} C X be a designated set of terminals. Recall that a 
partial partition II of X is a collection of pairwise disjoint subsets of X. For a shortest path P in 
X, define Zp = Zp{I[) using Eqn. ([l]), which is similar to before, except that now II is a partial 
partition. We first extend Definition |1.4[ 



Definition 2.1. We say that X is /3-terminal-decomposable with concentration if for every A > 
there is a probability distribution fi over partial partitions of X, satisfying the following properties. 

• Diameter Bound: For all II G supp(;u) and all S G II, diam{S) < A. 

• Separation Probability: For every x,y £ X, 

Pr [3S G n such that \S H {x,y}\ = I ] < ^^^^' ^^ . 



Terminal Cover: For every 11 E supp(/x), T C IJsen '^■ 

Degree of Separation: For every shortest path P and for every t > 1, 



Pr 



Zp>^max{d(P),A//3} 



< O min <^ kl3 



dipyr 

A 



e-f^(t) 



Theorem 2.2. Every finite metric space with k terminals is {4:\ogk)-terminal-decomposable with 
concentration. 

Define the truncated exponential with parameters A, A > 0, denoted Texp(A, A), to be distribu- 
tion given by the probabihty density fmiction g\^A{x) = .,-,_ -a/a-i ^" foi^ ^^ ^ ^ [Oj ^)- 



We are now ready to prove Theorem 2.2 For simphcity of notation, we prove the result with 
cluster diameter at most 2 A instead of A. Fix a desired diameter bound A > 0, and set for the 
rest of the proof A := j^^— ^ and g := ^a.a- For x ^ X and r > 0, we use the standard notation 
of a closed ball B{x,r) := {y £ X : d{x,y) < r}. We define the distribution fi via the following 
procedure that samples a partial partition 11 of X. 



1: for j = 1,2, . . . ,fc do 

2: choose independently at random Rj ~ Texp(A, A), and let Bj = B{tj, Rj 

3: set Sj = Bj \ [j^2, Bm- 

4: return H = {5i, . . . , 5^} \ {0}. 



The diameter bound and terminal partition properties hold by construction. The proof of the 
separation event property is identical to the one in jBar96t Section 3]. The following two lemmas 



prove the degree of separation property, which will conclude the proof of Theorem 2.2 Fix a 
shortest path P in X, and let us assume that t/2 is a positive integer; a general t > 1 can be 
reduced to this case up to a loss in the unspecified constant. 



Lemma 2.3. If d{P) < A, then Pi[Zp > t] < 2e 



-n{t) 



Proof. Split the k terminals into Jfar := {j S [k] : d{tj, P) > A — 2A} and Jnear := [k] \ Jfar- Define 
random variables Zfar := #{j G Jfar : -Bj Pi P 7^ 0} and Znear := #{j S Jnear : Sj Ci P ^ 0}. Then 

Zp < ^far + ^ncar and 

Pr[Zp > t] < Pr[Zfar + Zncar > t] < Pr[Zfar > t/2] + Pr[Znear > t/2]. 

For every j G Jfar, 

Fr[Bj n P / 0] < Pi[Rj > A - 2A] = / g{x)dx = - — ^(e"^^ - e"^) < -, 

Ja-2\ k — 1 k 

and therefore E[Zfar] < 8. Since Zfar is the sum of independent indicators, by a Chernoff bound 
Pr[Zfar > t/2] < 2-*/2 fop ah t > 32e. For smaller t, observe that Pr[Zfar = 0] > (1 - 8/A;)^ > n{l), 
and thus for every t > 1 we have Pr[Zfar > t/2] < e~^'*\ 

Next, consider which balls among {Bj : j £ Jnear} have non-empty intersection with P, and let 
Ji < J2 < ■ ■ ■ denote their indices. Formally, we place a conditioning henceforth on a some event £ 



that determines whether Rj > d{tj, P) occurs or not for each j G Jncar- For a = 1,2,..., let Ya be 
an indicator for the event that ball Bj^ does not contain P. Then 



Pr[y, = 1 I £:] = Pr P^B.J PnB,^ / 



<Pr 



R,<d{tj^,P)+X\R,>d{t,^,P) 



< 



< 



Having conditioned on £, the event {Z^cslt > i/2} implies that Ya = 1 for all a £ [t/2], and since 
these random variables Ya are independent, Pr[Zncar > t/2 \ £] < (3/4)*' ^ < e"*^* for an appropriate 
constant C > 0. The last inequality holds for all such events £ (with the same constant C > 0), 
and thus also without any such conditioning. 

Altogether, we conclude that Pr[Zp > t] < 2e"'^(*). D 



Lemma 2.4. If d{P) > X, then Pt[Zp > td{P)/\] < O ('minjfclogfc, ^ }) e 



-n(t) 



Proof. Treating P as a continuous path, subdivide it into r := \d{P)/X\ segments, say segments 
of equal length that are (except for the last one) half open and half closed. The induced subpaths 
Pi,. . . ,Pr of P are disjoint (as subsets of X) and have length at most A each, though some of 
subpaths may contain only one or even zero points of X. Writing Zp = X^jgM Zp-, we can apply 
a union bound and then Lemma 12.31 on each Pi, to obtain 



Pr[Zp > td{P)/\] < Pr 3i G [r] such that Zp^ > t/2 



<o 



■d{P) 



-n(t) 



Furthermore, for every j G [k], let Aj := {i £ [r] : Pi Ci B{tj,A) ^ 0}, and since P is a shortest 
path, l^jl < 4A/A = 41ogA;. Observe that Zp. = (with certainty) for all i ^ Ujv4j, hence 

Fr[Zp > td{P)/\] < Pr 3i G ^j(.[k]Aj such that Zp^ > t/2 < 4/clog fce"^^*) . 

D 



3 Terminal- Centered Minors: Main Construction 



This section proves Theorem 
Assumption 3.1. 2? < 2^^. 



1.3 



when V := ".^^"'"^^ ■'^;"' ^ satisfies the following assumption. 

mm,, ,,aT drAu,v) o t- 



By scaling all edge weights, we may further assume that miiiu^veT dciu, v) = 1. 

Notation 1. Let Vi,...,Vk ^ V. For S C [k], denote Vs := UieS^- -^'^ addition, denote 
V±:=V\ Vik] and V±+j := V± U Vj for all j G [k]. 

We now present a randomized algorithm that, given a graph G = (V, E, w) and terminals T dV , 



constructs a terminal-centered minor G' as stated in Theorem 1.3, This algorithm maintains a 
partial partition {Vi, V2, . . . , V^} of V, starting with Vj = {tj} for all j G [k]. The sets grow 
monotonically during the execution of the algorithm. We may also think of the algorithm as if 
it maintains a mapping / : F — t- T U {J-}, starting with /|t = id and gradually assigning a 
value in T to additional vertices, which correspond to the set Vrw. Thus, we will also refer to the 
vertices in Vr;,] as assigned, and to vertices in Vj_ as unassigned. The heart of the algorithm is 



7 



two nested loops (lines E]l9]) . During every iteration of the outer loop, the inner loop performs k 
iterations, one for every terminal tj. Every inner- loop iteration picks a random radius (from an 
exponential distribution) and "grows" Vj to that radius (but without overlapping any other set). 
Every outer-loop iteration increases the expectation of the radius distribution. Eventually, all nodes 
are assigned, i.e. {Vi, V2, ■ ■ ■ , Vk} is a partition of V. 

Definition 3.2. Let U '^V, by G[U] we denote the subgraph of G induced by U, with induced edge 
lengths (i.e. wI^j^qiij])). For a subgraph H of G with induced edge lengths, a vertex v G V{H) and 
r > 0, denote B}{{v,r) := {u £ y{H) : dniujv) < r}, where dfj is the shortest path metric in H 
induced by w. 

Input: G= {V,E,w), T = {ti, . . . ,tk} cy 
Output: A partition {Vi, V2, • • • , Vk} of V. 
1: set b^ 1 + 1/(35 log fc) 

for every j G [k] set Vj- ■<— {tj}, r^ = 0. 
set i -^ 0. 1 1 i is the iteration number of the outer loop. 
while Vjjt] 7^ V' do 
i ■(^ i + l. 
for all j G [k] do 

choose independently at random i?*- ~ exp(6*). 



r,+R^. 



Vj- -^ Vj- U Bq^y j{tj,rj). 11 Actually, this is the same as Vj ^ Bq^y ■^{tj,rj). 
return {Vi,V2, . . . ,Vk}. 



Algorithm 1: Partitioning V 

Claim 3.3. The following properties hold throughout the execution of the algorithm. 

1. For all j G [k], Vj is connected in G, and tj G Vj. 

2. For every ji, j2 G [k], if ji / J2, then Vj^ fl Vj^ = 0. 

3. For every outer loop iteration i and every j G [k], ifV- denotes the set Vj in the beginning of 
the ith iteration (of the outer loop), and V." denotes the set Vj at the end of that iteration, 
then V^ C Vf. 

In what follows, we analyze the stretch in distance between a fixed pair of terminals. We show 
that with probability at least 1 — l/k^, the distance between these terminals in G' is at most 
0(log k) times their distance in G. By a union bound over all (2) pairs of terminals, we deduce 
Theorem 1 1.3[ Let s,t GT, and let P* be a shortest st-path in G. Due to the triangle inequality, 
we may focus on pairs which satisfy V{P*) CiT = {s,t}, where V{P*) is the node set of P*. We 
denote i := w{P*) = dermis, t). 

3.1 High Level Analysis 

Following an execution of the algorithm, we maintain a (dynamic) path P between s and t. In a 
sense, in every step of the algorithm, P simulates an st-path in the terminal-centered minor induced 
by Vi, V2, . . . , Vfc. In the beginning of the execution, set P to be simply P* . During the course 



of the execution update P to satisfy two invariants: At every step of the algorithm, the weight of 
P is an upper bound on the distance between s and t in the terminal centered minor induced by 
V\, . . . ,Vk (in that step). In addition, if / is a subpath of P, whose inner vertices are all unassigned, 
then I is a subpath of P* . Throughout the analysis, we think of P as directed from s to i, thus 
inducing a linear ordering of the vertices in P. 

Definition 3.4. A subpath of P will be called active if it is a maximal subpath whose inner vertices 
are unassigned. 

We now describe how P is updated during the execution of the algorithm. Consider line [9] 
of the algorithm for the ith iteration of the outer loop, and some j G [k] . We say that the ball 
B = Bqiy^ .1 {tj,rj) punctures an active subpath A of P, if there is an inner node of A that belongs 
to the ball. If B does not puncture any active subpath of P, we do not change P. Otherwise, 
denote by n, v the first and last unassigned nodes (possibly not in the same active subpath) in 
V{P) n B respectively. Then we do the following: 

1. We replace the entire subpath of P between u and v with a concatenation of a shortest 
utj-path. and a shortest tjV-path. that lie in B; this is possible, since G[B] is connected, and 
u,tj,v G B. This addition to P will be called a detour from u to v through tj. The process is 



illustrated in figures 2(a) 2(c) Beginning with P* , the figure describes the update after the 
first four balls. Note that the detour might not be a simple path. It is also worth noting that 
here u and v may belong to different active subpaths of P. For example, in figure 2(c)[ the 



new ball punctures two active subpaths, and therefore in figure 2(d) , the detour goes from a 
node in one active subpath to a node in another active subpath. Note that in this case, we 
remove from P portions which are not active. 

2. If, for some iteration i' < i of the outer loop, and for some u',v' £ V{P), we added a detour 
from u' to v' through tj in iteration i' , we keep only one detour through tj, from the first node 



between u,u' and to the last between v,v'. For example, in figure 2(d), the ball centered in 



fs punctures an active subpath. Only one detour is kept in figure 2(e) 



The total weight of all detours during the execution will be called the additional weight to P 
(ignoring portions of P that are deleted from P). Denote the set of active subpaths of P in the 
beginning of the ith iteration of the outer loop by Ai. 

Let Vi^, . . . , V/*" be the partition returned by the algorithm, let G' the terminal-centered 
minor induced by that partition, and let w' be the standard restriction of w to G'. Denote by pf^^ 
the path obtained at the end of the execution. 

Claim 3.5. At every step of the algorithm the following holds: 

1. The weight of P is an upper bound on the distance between s and t in the terminal centered 
minor induced by Vi,...,Vfc. Moreover, once Ai = ^ (namely, P has no active subpaths), 
the additional weight of P is an upper bound on the distance between s and t in the terminal 
centered minor induced by V/^^, . . . , V^ (actually, from this point on, P = P-f^"-. 

2. If A is a subpath of P, whose inner points are all in V±, then A is a subpath of P* . 

3. If Ai,A2 are two different active subpaths of P, they are internally disjoint. 
4- \-^i\ ^ ^ for 0,11 i. 
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t5 



t = t2 \ 



m.ri) 



B = tl\ 



i3 







B(t2,r2) 



t = ti 



(a) We begin with P = P* 



(b) In every iteration of the inner loop, (c) Replacing a portion of P with a de- 
we grow a terminal-centered ball tour. 




(d) Endpoints of a detour can belong 
to different active subpaths. 



(e) Joining detours. 



Figure 2: Updating P 



Proof. Follows easily by induction on i,j. 
Corollary 3.6. dG',n,'{s,t) < w{Pf^'^). 



D 



Let A & Ai- During the execution of the inner loop, A is either removed from P entirely, 
or some subpaths of A remain active (perhaps A remains active entirely). Therefore, for every 
A' G ^i+i, either A' is a non-trivial subpath of A (by non-trivial we mean |y(^')| > 3), or A' and 
A are internally disjoint. Therefore there is a laminar structure on |Jj-^i- 

We describe this structure using a tree T, whose node set is {{i,A)\ A G Ai}. The root of T 
is (1,P*), and for every i and every A G Ai, the children of {i,A), if any, are all pairs {i + 1,^4'), 
where A' G Ai+i is a subpath of A. Whenever we update P we log the weight of the detour by 
charging it to one of the nodes of T as follows. Let u, v be unassigned nodes in P, and consider a 
detour from u to u in the ith. iteration of the outer loop for some i. The weight of the detour is 
charged to the (unique) active subpath A £ Ai such that u £ A. For every i and A £ Ai, denote 
the total weight charged to {i, A) by Wi^A (If the node is never charged, the weight of the node is 
set to 0). Therefore, 



w 



(P^*") < w{P* 






(4) 



Equation Q together with Corollary 3.6 imply that if we show that with high probability, the total 
weight charged to the tree is at most 0(log k) • £, we can deduce Theorem |l.3[ For the rest of this 
section, we therefore prove the following lemma. 
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Lemma 3.7. With probability at least 1 — 0{k ^), the total weight charged to the tree is at most 
0(log6 k)e. 

Let p = 1/100. An active subpath A £ Ai will be called short if w{A) < pU. Otherwise, A will 
be called long (Note that P* € ^i is long). 

Definition 3.8. Let i> 1. Let A £ Ai be a short subpath. Denote by 7i,A the subtree of T rooted 
in {i,A). If, in addition, the parent of {i,A) in T consists of a long subpath, Ti^A will be called a 
short subtree ofT- 

3.2 The Behavior of Short Subtrees 

Clearly, every node of a short subtree of T consists of a short active subpath. The height and 
weight of short subtrees will play an important role in the analysis of the height and weight of T, 
and therefore we begin by analyzing short subtrees. Let io > 1. Let A G Ai^ be a short subpath, 
and assume in addition that H^^a is a short subtree of T. 

3.2.1 The Effect of a Single Ball 

Fix some i > io, and some j € [k]. Let X denote the number of active subpaths of A in the 
beginning of the jth iteration of the inner loop (during the ith iteration of the outer loop). Let X' 
denote the number of active subpaths of A in the end of the jth. iteration. Denote by B the ball 
considered in this iteration, namely B := S^ry^ i(tj,rj). 

Proposition 3.9. With certainty, X' < X + I. 

Proof. Let Ai,A2,..., Ax be the subpaths of A which are active in the beginning of the jth iteration 
ordered by their location on P. For a £ [X] denote by Ua,Va the first and last unassigned nodes 
in Aa, respectively. If B does not puncture any of these subpaths, then X' < X < X + 1 (Note 
that subpaths of A can still be removed if B punctures active subpaths of P not contained in A). 
So assume B punctures A^. Assume first that A^ is the only subpath of P which is active and 
is punctured by B. Then there are three options: If both Ua,Va £ B, then A^ is replaced and 
removed entirely from P when adding the detour, and X'<X — 1<X + 1. If Ua £ B and Va ^ B, 
let v' be the last node in V{Aa) H B then the Uav' segment of A^ is replaced, and the segment 
v'va remains active. Therefore X' < X < X + 1. The argument is similar, if Ua ^ B and Va £ B. 
Otherwise, some of the inner portion of A is replaced by a non active path, and both end segments 
of Aa remain active, therefore X' = X + 1. Next, assume the ball punctures several active subpaths 
of A, and maybe more subpaths of P. Denote by 1^, Ip the first and last subpaths of A punctured 
by B. Denote by u the first node in V{Ict)r\B, and v the last node in V{Ip)r\B. When updating P, 
the entire subpath of P between u and v is removed. Thus X'<X — {l3 — a + l)<X<X + l. D 

For the next proposition, we need the following definition, which is a generalization of the notion 
of a ball puncturing an active subpath. Recall that a ball B is said to puncture an active subpath 
A' if there is an inner node in A' that belongs to the ball. 

Definition 3.10. Let A' be an active subpath of P. We say that a ball B reaches A' ifV{A')r\B / 0. 

By the definition of the balls in the algorithm, B may reach A' and not puncture if and only if 
A' has one endpoint in Vj. 
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Proposition 3.11. 

Pr[X' > X \ B reaches an active subpath of A] < p . 

Proof. Let A' be an active subpath of A closest to tj. Let d = (Iq^Vj^^ ]itj , A') = dQ[Vj_^ ■]itj , A) be 
the distance in G[Vj_+j] between tj and a nearest node in A' . Assume that the ball B punctures 
an active subpath of A. Following the analysis of the previous proof, if X' > X, then B punctures 
exactly one active subpath of A and does not cover it, or B punctures exactly two active such 
subpaths and does not cover both of them. In either case, B punctures A' and does not cover it, 
implying that rj + i?* > d, and rj -\- BJ"- < d + pU (recall that A is short, and therefore, so is A'). 
If B reaches an active subpath of A, and does not puncture it, we get the same conclusion, since 
this means rj + i?*- > rj > d. By the memoryless property, 

Pr[X' > X \ B reaches an active subpath of A\ < 

< Fi[E^j <d-rj+ pb' \Ri>d- rj] < 1 - e^^ < p 

D 

3.2.2 The Effect of a Sequence of Balls 

Consider the first A^ balls that reach some active subpath of A, starting from the beginning of 
iteration Iq of the outer loop, and perhaps during several iterations of that loop. For every a G [A^], 
let Ya be the indicator random variable for the event that the ath ball reaching an active subpath 



of A decreased the number of active subpaths. In these notations, Proposition 3.11 stated that 

Va G [A^]. Pr[y„+i = 1 | ^i, . . . , FJ > 1 - p 
Let Y = Yl,a(^\N] ^a and let Z ~ Bin(N, I — p). Simple induction on A^ implies the following claim. 
Claim 3.12. \/k. Pr [Y > k] > Fi[Z > k]. 

Lemma 3.13. With probability at least 1 — 1/k^, after 70 log A; balls reach some active subpath of 
A, A has no active subpath. In addition, the height oflig^A is at most 35 log A;. 

Proof. Assume A^ = TOlog/c. Since whenever 1^ = 0, the number of active subpaths increases by 
at most 1, and whenever 1^ = 1, the number of active subpaths decreases by at least 1, if y > N/2, 
then A has no active subpaths. Therefore by the Chernoff Bound, 

Pr[^ has no active subpath after A^ balls reach A] > 

> Pr[y > N/2] > Pr[Z > N/2] > 1 - 1/k^ 

Next, consider some i > iq. li A has an active subpath at the end of the ith iteration of the outer 
loop, then at least two times during the ith iteration an active subpath of A is reached by a ball. 
After 35 log A; iterations of the outer loop, if A has an active subpath, then A^ > 70 log A;. By the 
same arguments as the first part of the proof, 

Pr[The height of Tio,A is at most A^/2] > Pr[y > A^/2] > Fi[Z > N/2] > 1 - 1/A;^ 

D 
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We denote by Ei the event that for every i, and every A ^ Ai, if H^a is a short subtree then 
after at most 70 log k balls reach an active subpath of A, A has no more active subpaths and in 
addition, the height of Ti^A is at most 35 log /c. 

Lemma 3.14. Pr[Ei] > 1 - l/k^. 

Proof. Fix some z, and A £ Ai. Assume that 7i,A is a short subtree. By definition, {i,A) has 
no short ancestor. For i' = logf,{i/p) < logf,{'D/p), P* itself is short, since V = £/p, and thus 
all tree nodes in level i' (and lower) are short. Therefore, i < i'. Since there are at most k 
nodes in every level of the tree, the number of short subtrees of T is at most k ■ logi,{'D/p) = 
k ■ (logftP + logft 100) < k^logk + 0{klogk) < 0{k^logk). By the previous lemma, and a union 
bound over all short subtrees, the result follows. D 

Since every node in level logi,{i/p) of the tree belongs to some short subtree, we get the following 
corollary. 

Corollary 3.15. With probability at least 1 — 2/k^, the height ofT is at most log^(^/p) + 301og A; < 
lOlogfeP. 

We denote by E2 the event that for all i < 10 log;, P and j G [k], the radius of the jth ball of 
the ith iteration of the outer loop is at most 2806*"*"^ log k. 

Lemma 3.16. Pr[E2] > 1 - 1/k^ 



Proof. Fix i < lOlog^P and j G [k]. Then Pr[i?*- > 86MogA;] < l/k^. By assumption 3.1 
lOlogftP = 0(logPlog/c) = 0(A;'^log A;). Thus by a union bound over all values of i and j in 
question, 

Pr[Vz,j. R^ < Sb^logk] > 1 - ^-^"Jf^^ >1-^ 

It follows that, with probability at least 1 — l/k"^, the radius of the jth ball of the ith iteration of 
the outer loop is at most Yli'<i ^^* i^S ^ — 2806*+-*^ log^ k. D 

Summing everything up, we can now bound with high probability the weights of all short 
subtrees of T. 

Claim 3.17. Conditioned on the events Ei and E2, for every iq > 1 and A G Ai^. If Ti^^A is a 
short subtree ofT, then the total weight charged to nodes ofTi^^A is at most 39200e6*'''^^ log k with 
certainty. 

Proof. Conditioned on Ei, at most 70 log A; detours are charged to nodes of every short subtree 
and for i = iq + 35 log A;, there are no more active subpaths of A. Conditioned on E2, the most 
expensive detour is of weight at most 5606*'''''^^ ^^"''^ log k, we get that the total weight charged 
to nodes of the subtree is 70 log k ■ 5606*0+^ • b^^'"^'' • log^ k < 39200e6*o+i log^ k. D 

3.3 Bounding The Weight Of T 

We are now ready to bound the total weight charged to the tree. We sum the weights in each level 
of the tree from the root down. In each level we sum the weights charged to long active subpaths, 
and the weights of short subtrees rooted in that level. More formally, for every i < log^(^/p), 
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denote by li the total weight charged to nodes of the form {i,A), where A €z Ai is a long active 
subpath. Denote by Sj the total weight charged to short subtrees rooted in the ith level of T. For 
every i > logfj{£/p), and every A £ Ai, A is short and therefore (i, A) belongs to some short subtree 
rooted in level at most logf,{£/p). Therefore, 

I0g6(^/P) 

Y^ wi,A = Yl (^^ + ^'^ (^) 

{i,A)er i=l 

Since i > 1, then the root of T, namely (1, P*), consists of a long path. Therefore, si = 0. We can 
therefore rearrange Equation (pi) to get the following. 

logb(^/p) 

{i,A)er j=i 

Let i < logf,{i/p). Let ^ G ^j be a long active subpath. That is w{A) > pU . Thinking of A as 
a continuous path, divide A into w{A)/[ph'^) segments of length pU . Some segments may contain 
no nodes. Let / be a segment of A, and assume / contains nodes (otherwise, no cost is charged to 
A on account of detours from /). Following the same arguments of propositions 3.9 and 3.11 we 
get the following result. 

Lemma 3.18. With probability at least 1 — 1/k^, no more than TOlog/c balls puncture I. 

Denote by E3 the event that for every i > logf,{i / k'^) , and for every long active subpath A £ Ai, 
in the division of A to segments of length p¥ , every such subsegment is reached by at most 70 log k 
balls. 

Lemma 3.19. Pr[E3] > 1 - 1/k^. 

Proof. Since for every i > log^(^/p), every A £ Ai is short, the number of relevant iterations (of the 
outer loop) is at most logi,{£/p) — logf,(£/A;^) = logb(/c^/p) < 0(log^ k). For every i > logf,(£/A;^) and 
every long path A S Ai, the number of segments of ^ is at most w{A)/{pb'^) < w{A)/{pi/k'^) < k'^/p. 
Therefore the number of relevant segments for all i > logf^{i/k'^) and for all long A £ Ai is at most 
O^k"^ log k) Applying a union bound over all relevant segments the result follows. D 

Since Pr[Ei] > 1 — 1/k^ and Pr[E2] > 1 — 1/A;^, we get the following corollary. 

Corollary 3.20. Pr[Ei A E2 A E3] > 1 - 0{k-'^) 

It follows that it is enough for us to prove that conditioned on Ei, E2 and E3, with probability 
1 the total weight charged to the tree is at most 0(log^ k)£. 

Lemma 3.21. Conditioned on E2 and E3, li < 5606*"^ ""^ A; log k. In addition, if i > logf,(^/A:^), 
h < 0(log A;) • i with probability 1. 

Proof. To see the first bound, observe that by the update process of P, at most k detours are 
added to P during the ith iteration. Conditioned on E2, each one of them is of weight at most 
5606*"^^ log k. To see the second bound, let A £ Ai be a long active subpath. The additional 
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weight resulting from detours from vertices of A is at most the number of segments of A of length 
pb^, times the additional weight to each segment. Therefore, the additional weight is at most 

WLA < w{A)/pb' ■ 70 log k ■ 5606^+^ log^ k = 392005/p • log^ k ■ w{A) . 

Since all paths in Ai are internally disjoint subpaths of P* , we get: 

li= Yl ^^'^^ E Oilog" k) ■ w{A) < 0{logH) ■ e 

long A^A^i long A^A^i 

D 
Lemma 3.22. Conditioned on events Ei, E2 and E3, Sj+i < 39200e6*"'"^/clog^ /c. In addition, if 

i >iogf,{i/e), si+i < o{iog^ k) ■ i. 



Proof. Conditioned on Ei and E2, we proved in Claim 3.17 that the total weight charged to a short 
subtree rooted in level i + 1 is at most 3920066*"*"^ log k with certainty. Since there are at most k 
such subtrees, the first bound follows. To get the second bound, note that by the definition of a 
short subtree, for every short subtree T' rooted at level i + 1, the parent of the root of T' consists of 
a long active subpath A of level i. Conditioned on E3, every segment of A is intersected by at most 
70 log /c balls. Therefore, {i,A) can have at most [w{A)/pU) ■ 70 log /c children, and in particular, 
children consisting of short active subpaths. The cost of a short subtree rooted in the i + 1 level of 
T is at most 39200e6*+^ log k. Thus the total cost of all short subtrees rooted in children of A is 
bounded by 

{w{A)/pb') • 70 log k • 39200e6^+^ log^ k < 0(log^ k) • w{A) 

Summing over all (internally disjoint) long subpaths of level i, the result follows. D 

We now turn to the proof Lemma |3.7[ 



Proof of Lemma 3.7, Since Pr[Ei A E2 A E3] > 1 — 0{k ), it is enough to show that conditioned 



on El, E2 and E3, the total weight charged to the tree is at most (lo g k)i with certainty. Recall 



that Y.{i,A)eT'^i,A = I^S (h + Sj+i). Following Lemmas 



3.21 



and 



3.22 



we get that 



J2 ('' + ^i+i) ^ Yl (5606*+ifc log2 k + 39/200e6'+2^ log^ k) 

i=l 8=1 

logMk^) 



(7) 
b i 



<0{klog^k) Y b' = 0{klog^k)-—---^ = o{l)-£ 



i=l 



In addition, 



logbWp) log^ie/p) 

Y ih + s^+l)< Yl oiiogHy + oiiog" k)e) 

iog^{e/k2)+i i=iog(,(f/fe2)+i 

< 0(log^ k)£ ■ ilog,{i/p) - log,ii/k^)) 

< 0(log^ k)e ■ 0(log^ k) = 0{log^ k) ■ e 
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(8) 



D 



Before describing the generalized algorithm, used to discard Assumption 3J_, we wish to note 
the following fact, which follows directly from the analysis of Algorithm [!} 

Remark 1. By slightly changing the analysis of Algorithm [^ (without changing the actual algo- 
rithm.) we can get that for every k > k, and every d> 3, the algorithm achieves a stretch factor of 
0{d'^ log^ k) with probability 1 — 0{k^). 

4 Terminal- Centered Minors: Extension to General Case 



In this section, we complete the proof of Theorem |1.3| by reducing it to the special case where 
Assumption 3.1 holds (which is proven in Section Isl). Let us outline the recursive algorithm for 



the general case. The algorithm first rescales edge weights of the graph so that minimal terminal 
distance is 1. li V < 2^ then we apply Algorithm [ij and we are done. Otherwise, we construct 
a set of at most k — 1 well-separated and low diameter balls whose union contains all terminals. 
Then, for each of the balls, we apply Algorithm [T] on the graph induced by that ball. Each ball 
is then contracted into a "super-terminal". We apply the algorithm recursively on the resulting 
graph G with the set of super-terminals as the terminal set. Going back from the recursion, we 
"stitch" together the output of Algorithm [T] on the balls in the original graph with the output of 
the recursive call on G, to construct a partition of V as required. The exact algorithm is described 



in Section 4.1 Before that, we need a few definitions. 

Assume that the edge weights are already so that the minimum terminal distance is 1. Denote 
by D the set of all distances between terminals, rounded down to the nearest powers of 2. Note 
that \D\ < k"^ . Consider the case 2? > 2 . There must exist < mo < k^ — k such that 
D n {2™o^ 2'"o+i, . . . , 2"^"+'=} = 0. Define R := {{x, y)eT^ : dcix, y) < 2™"}. 

Claim 4.1. R is an equivalence relation. 

Proof. Reflexivity and symmetry of R follow directly from the definition of a metric. To see that 
R is transitive, let x,y,z G T, and assume {x,y),{y,z) G R. Therefore dGix,y) < 2™° and 
dG{y,z) < 2'"o. By the triangle inequality, dG{x,z) < 2"'o+i_ Since L» n {2'"o, . . . , 2'^o+'=} = 0, 
dG{x,z) < 2"^°, and therefore (x,z) £ R. D 

For every equivalence class U G T/n, we pick an arbitrary u £ U, and define U = Bg{u, 2™°). 

Claim 4.2. U := {C^lt/GT/^ ^-5 a partial partition of V. Moreover, for every U G T/r>, U (^ tl , 
G\tj\ is connected and of diameter at most 2™"+-'^ < 2 . 

Proof. Let U G T /-n. Let n G C/ be such that U = Bg{u, 2"^" ). For every x G C/, by the definition of 
R, d(x, u) < 2"^° , and thus x £ U. Therefore U C.U. By the definition of a ball, G[U] is connected 
and of diameter at most 2™"°"*"^ < 2 . To see that W is a partial partition of V, take U' G T/tj such 

that U / U', and let u' G U' be such that U' = Bg{u',2'^o). Since {u,u') ^ R, dG{u,u') > 2"^°, 
andsinceDn{2'^o,...,2™o+''} = 0, dG{u,u') > 2""o+'=+^ thus f7 n [7' = 0. D 
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4.1 Detailed Algorithm 

Input: G = {V,E,w), T = {h, . . . ,tk} QV 
Output: A partition {Vi, V2, • • • , Vk} of V. 
1 



rescale the edge weights so that the minimal terminal distance is 1. 
if 2? := max„^^,g7- dciu, v) < 2^^ then 

run Algorithm [T| and return its output. 
else 

define R and U as above. 
for allU eU do 

run Algorithm 111 independently on G'[C'^]. 

contract [/ to a single "super-terminal", maintaining edge weights of all remaining edges, 
denote the resulting graph G. 

run Algorithm [2J recursively on G with the set of super-terminals. 
for all super-terminals u G G do 

for all vertices v assigned to u in the recursive call do 

assign v to its nearest terminal among the nodes contracted to u in line [8} 
return the resulting partition of V. 



Algorithm 2: Partitioning V - The General Case 

It is clear that Algorithm [2] returns a partition of V. In addition, since every level of recursion 
decreases the number of terminals in the graph, the depth of the recursion is at most k. During 
each level of the recursion, Algorithm [T] is invoked at most k times. Therefore, Algorithm [T] is 
invoked at most k"^ times, each time on a set of at most k terminals. By Remark [l| setting d = 5, 
we get that there exists Gq > 0, such that every time the algorithm is invoked, it achieves a weight 
stretch factor of at most Co log k, with probability at least 1 — 0{k~^). Applying a union bound, 
we get that with high probability, the stretch bound is obtained in all invocations of the algorithm. 
It remains to show that this suffices to achieve the desired stretch factor in G. 

Lemma 4.3. With probability at least 1 — 1/k, on a graph with t < k terminals, the algorithm 
obtains a stretch factor of at most Go log^ k + log^ k ■ 2^^ Y^k'<T 2(^')^ ^ 2Co log^ k. 

Proof. It is enough to show that conditioned on the event that every invocation of Algorithm [T] 
achieves a stretch factor of at most Co log k, the generalized algorithm achieves the desired stretch 
factor. We prove this by induction on k. For k = 2 the result is clear. Assuming correctness for 
every k < k, we prove correctness for k. Let s,t £ T. If (s, t) £ R, then s and t are in the same set 
in U, and by the conditioning, the stretch factor of the distance between s and t is at most Co log^ k. 
This does not change in steps 4 — 7 of the algorithm. Otherwise, Let s, t be the terminals (or super- 
terminals) associated with s and i in G respectively. Denote d = dG,w{s,t) and d = dQ^{s,i). 
Denote by G' = {V ,E' ,w') the terminal-centered minor induced by the partition returned by the 
recursive call, and by G" = {V" ,E" ,w") the terminal-centered minor induced by the partition 
returned in the final step of the algorithm. Denote d' = dG\w'is,t). and d" = dc ,w" {s , t) . Let u be 
a super terminal on a shortest path P' between s and t in C. Let P" be the path obtained from P' 
in G" in the following manner. In the place of every super-terminal u in P', originating in some node 
set U, we add a path between the corresponding terminals in U (based on the terminal-centered 
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minor constructed for G\U\ in step 3). The edges of P' are also replaced with corresponding edges 
in G". 

Recall that in G', the weight of every edge is the distance between its endpoints in G (by the 
definition of a terminal-centered minor). In G" the weight of every edge is the distance between 
its endpoints in G. Therefore the weight P' contained at most k — \ edges. In G" the weight of 
each such edge increases by at most /c2™o . In addition, every expansion of a super-terminal adds at 
most kT^° to the path. Therefore, w"{P") < w'{P') + 2k'^2""^ < d' + d ■ 2k'^2-^. By the induction 
hypothesis 



c^' < Co log*^ A: + log^ fc • 2-^= ^ {k'f]d. 

\ k'<k-l 

Since d < d, we get that 

d" < I Co log^ A: + log^ A; • 2-^= ^ 2ik'f\d + d-2k^2-'' 

\ k'<k-l J 

< Co log^ k + log*' k-2-''Y^ 2{k'f I d. 



k'<k 

D 
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